Direct and Indirect Discrimination Prevention Methods

نویسندگان

  • Sara Hajian
  • Josep Domingo-Ferrer
چکیده

Along with privacy, discrimination is a very important issue when considering the legal and ethical aspects of data mining. It is more than obvious that most people do not want to be discriminated because of their gender, religion, nationality, age and so on, especially when those attributes are used for making decisions about them like giving them a job, loan, insurance, etc. Discovering such potential biases and eliminating them from the training data without harming their decision-making utility is therefore highly desirable. For this reason, antidiscrimination techniques including discrimination discovery and prevention have been introduced in data mining. Discrimination prevention consists of inducing patterns that do not lead to discriminatory decisions even if the original training datasets are inherently biased. In this chapter, by focusing on the discrimination prevention, we present a taxonomy for classifying and examining discrimination prevention methods. Then, we introduce a group of pre-processing discrimination prevention methods and specify the different features of each approach and how these approaches deal with direct or indirect discrimination. A presentation of metrics used to evaluate the performance of those approaches is also given. Finally, we conclude our study by enumerating interesting future directions in this research body.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Simultaneous Discrimination Prevention and Privacy Protection in Data Publishing and Mining

Data mining is an increasingly important technology for extracting useful knowledge hidden in large collections of data. There are, however, negative social perceptions about data mining, among which potential privacy violation and potential discrimination. The former is an unintentional or deliberate disclosure of a user profile or activity data as part of the output of a data mining algorithm...

متن کامل

Inference Mining using Direct and Indirect Discrimination Prevention in Data Mining

Data Mining is an essential and flourishing technology to extract the relevant and useful information hidden in the large collections of data. Privacy preservation in data mining is an important issue when considering the legal and ethical aspects of data mining. Discrimination is one of the facts that pave the way for negative perceptions in the data mining. Direct and Indirect discrimination ...

متن کامل

Investigation of the performance of budget distribution between different provinces of Iran according to their capacity and needs during the first and last years of the tenth and eleventh government

The existence of economic discrimination in the country can pose a threat to the soft economic threat, and knowing how to measure it and examine its trends across different states can reflect their performance against this phenomenon. In this study, it has been tried to combine the economic discrimination index between the provinces of Iran according to the principles of the Islamic Republic of...

متن کامل

A Causal Framework for Discovering and Removing Direct and Indirect Discrimination

In this paper, we investigate the problem of discovering both direct and indirect discrimination from the historical data, and removing the discriminatory effects before the data is used for predictive analysis (e.g., building classifiers). The main drawback of existing methods is that they cannot distinguish the part of influence that is really caused by discrimination from all correlated infl...

متن کامل

Rule Protection for Indirect Discrimination Prevention in Data Mining

Services in the information society allow automatically and routinely collecting large amounts of data. Those data are often used to train classification rules in view of making automated decisions, like loan granting/denial, insurance premium computation, etc. If the training datasets are biased in what regards sensitive attributes like gender, race, religion, etc., discriminatory decisions ma...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013